AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.34)

Neural Information Processing SystemsJan-20-2025, 13:04:24 GMT

Reviews: The Multiscale Laplacian Graph Kernel

My main concerns are relative to the missing intuition and unclear explanations in some parts of the paper, as well as to provide more details in the experimental section. More specific information are given in the comments below. I would overall consider the paper as based on a novel idea. References: - Reference [10] is poorly written: rather than "jmlr" a clear statement as in reference [5] would be preferable. Section 2: - It would be nice to better clarify the notation, in particular regarding graphs, before describing the method.

explanation, multiple different scale, multiscale laplacian graph kernel, (6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.70)

Neural Information Processing SystemsOct-8-2024, 05:56:37 GMT

Reviews: On the Complexity of Learning Neural Networks

The paper presents a new information-theoretic lower bound for learning neural networks. In particular, it gives an exponential statistical query lower bound that applies even to neural nets with only one hidden layer using common activation functions and log-concave input distributions. By assuming log-concavity for the input distribution, the paper proves a lower bound for a more realistic setting than previous works with applied to discrete distributions. To prove this SQ lower bound, the paper extends the notion of statistical dimension to regression problems and proves that a family of functions which can be represented by neural nets with one hidden layer has exponentially high statistical dimension with respect to any log-concave distribution. It is not clear to me if there are any novel technical ideas in the proofs, but the idea of studying the SQ complexity of neural networks in order to obtain lower bounds for a more realistic class of nets is (as far as I can tell) novel and clever.

complexity, learning neural network, statistical dimension, (7 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsOct-7-2024, 22:26:04 GMT

Reviews: Data-Efficient Reinforcement Learning in Continuous State-Action Gaussian-POMDPs

This paper describes an extension to the PILCO algorithm (Probabilistic Inference and Learning for COntrol, a data-efficient reinforcement algorithm). The proposed algorithm applies a measurement filtering algorithm during the actual experiment and explicitly takes this measurement filtering algorithm into account during the policy learning step, which uses data from the experiment. This is an important practical extension addressing the fact that measurements are often very noisy. My intuitive explanation for this approach is that the proposed approach makes the overall feedback system more "repeatable" (noise is mostly filtered out) and therefore learning is faster (given that the filtering is effective, see last sentence of the conclusion). The paper presents detailed mathematical derivations and strong simulation results that highlight the properties of the proposed algorithm.

algorithm, continuous state-action gaussian-pomdp, data-efficient reinforcement learning, (5 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.40)

#artificialintelligenceFeb-19-2023, 02:15:10 GMT

Intuitive Explanation of Straight-Through Estimators with PyTorch Implementation

Sometimes we want to put a threshold function at the output of a layer. This can be for a variety of reasons. One of them is that we want to summarize the activations into binary values. This binarization of activations can be useful in autoencoders. However, thresholding poses a problem during backpropagation. The derivative of threshold functions is zero.

autoencoder, gradient, straight-through estimator, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.43)

#artificialintelligenceFeb-13-2023, 13:15:45 GMT

An Intuitive Explanation of Policy Gradient – Towards AI

Originally published on Towards AI. Join over 80,000 subscribers and keep up to date with the latest developments in AI. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

intuitive explanation

Technology: Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceSep-2-2020, 09:50:21 GMT

Intuitive Explanation of Differentiable Architecture Search (DARTS)

This is a paper that came out in the midst of 2018, addresses the problem of scalability of searching a network architecture. These papers address the problem of Neural Architecture Search or NAS in short. As the name suggests, the idea behind this field is to explore how can we automatically search deep learning model architectures. Currently, most of the data science problems are solved by manually designing the model architecture which gives "state of the art" results on any given dataset. The problem with this approach is that, though these architectures perform really good on the standard datasets, they don't perform as expected on the organisation specific datasets.

artificial intelligence, deep learning, machine learning, (5 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

#artificialintelligenceDec-20-2019, 06:03:50 GMT

XGBoost: An Intuitive Explanation

We all know how XGBoost dominates in Kaggle competitions due to its performance and speed. This blog is about understanding how XGBoost works (try to explain the research paper). This blog is not about how to code/ implement XGboost or how to tune its hyperparameters. XGBoost stands for eXtreme Gradient Boosting. It explains bagging (bootstrap aggregating) and boosting (Adaptive Boosting).

algorithm, intuitive explanation, xgboost, (3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (1.00)

Luss, Ronny, Chen, Pin-Yu, Dhurandhar, Amit, Sattigeri, Prasanna, Shanmugam, Karthikeyan, Tu, Chun-Chen

Generating Contrastive Explanations with Monotonic Attribute Functions

arXiv.org Machine LearningMay-29-2019

Explaining decisions of deep neural networks is a hot research topic with applications in medical imaging, video surveillance, and self driving cars. Many methods have been proposed in literature to explain these decisions by identifying relevance of different pixels. In this paper, we propose a method that can generate contrastive explanations for such data where we not only highlight aspects that are in themselves sufficient to justify the classification by the deep model, but also new aspects which if added will change the classification. One of our key contributions is how we define "addition" for such rich data in a formal yet humanly interpretable way that leads to meaningful results. This was one of the open questions laid out in Dhurandhar et.al. (2018) [5], which proposed a general framework for creating (local) contrastive explanations for deep models. We showcase the efficacy of our approach on CelebA and Fashion-MNIST in creating intuitive explanations that are also quantitatively superior compared with other state-of-the-art interpretability methods.

artificial intelligence, explanation, machine learning, (20 more...)

arXiv.org Machine Learning

1905.12698

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

#artificialintelligenceJul-12-2017, 21:40:08 GMT

An Intuitive Explanation of Convolutional Neural Networks

What are Convolutional Neural Networks and why are they important? Convolutional Neural Networks (ConvNets or CNNs) are a category of Neural Networks that have proven very effective in areas such as image recognition and classification. ConvNets have been successful in identifying faces, objects and traffic signs apart from powering vision in robots and self driving cars. In Figure 1 above, a ConvNet is able to recognize scenes and the system is able to suggest relevant tags such as'bridge', 'railway' and'tennis' while Figure 2 shows an example of ConvNets being used for recognizing everyday objects, humans and animals. Lately, ConvNets have been effective in several Natural Language Processing tasks (such as sentence classification) as well. ConvNets, therefore, are an important tool for most machine learning practitioners today.

artificial intelligence, machine learning, matrix, (16 more...)

Industry: Transportation > Ground (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)